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Abstract 

Alon and Krivelevich (SIAM J. Discrete Math. 15(2): 211-227 (2002)) show that if a graph 
is e-far from bipartite, then the subgraph induced by a random subset of 0(1/ e) vertices is 
bipartite with high probability. We conjecture that the induced subgraph is Cl(e)-far from 
bipartite with high probability. Gonen and Ron (RANDOM 2007) proved this conjecture in 
the case when the degrees of all vertices are at most 0(en). We give a more general proof that 
works for any d-regular (or almost d-regular) graph for arbitrary degree d. 

Assuming this conjecture, we prove that bipartiteness is testable with one-sided error in time 
0(1/ e c ), where c is a constant strictly smaller than two, improving upon the tester of Alon and 
Krivelevich. As it is known that non- adaptive testers for bipartiteness require f2(l/e 2 ) queries 
(Bogdanov and Trevisan, CCC 2004), our result shows, assuming the conjecture, that adaptivity 
helps in testing bipartiteness. 

1 Introduction 

A graph on n vertices is e-far from bipartite if it cannot be made bipartite even after removing an 
arbitrary set of ef^j of its edges. A randomized algorithm is a (one-sided) tester for bipartiteness 
with distance parameter e if, given oracle access to a graph 67, the algorithm always accepts if G 
is bipartite and rejects with probability at least 1/2 if G is e-far from bipartite. 

The problem of testing bipartiteness was among the first ones considered in the framework 
of property testing. In their work which introduces combinatorial property testing, Goldreich, 
Goldwasser and Ron |GGR98j give a tester for bipartiteness (of dense graphs) that runs in time 
■'■'')■ 

Later works [AFKS99, AS05] gave many examples of graph properties that are testable in time 
independent of the size of the graph. Alon et al. [AFNS06] gave a combinatorial characterization of 
all such graph properties. However, although all these properties have testers whose running time 
is independent of the graph size, the running time is very large in terms of the distance parameter 
e. Typically this dependence is a tower of exponentials of height polynomial in 1/e, and sometimes 
even worse. The super-exponential dependence on e in these analyses owes to applications of 
the regularity lemma. In contrast, there are relatively few properties that are testable in time 
polynomial in 1/e. Bipartiteness is perhaps the most natural example of such a property. 

In most cases, the design of property testing algorithms for graphs is relatively straightforward; 
it is the analysis that is difficult. This can be partially explained by a theorem of Goldreich and 
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Trevisan [GT03] which says that any tester that makes at most q queries can be simulated by a 
tester that chooses a subgraph on 0(q) vertices, makes all queries between the vertices and accepts 
if the induced subgraph satisfies a related property. Applying this canonical algorithm of Goldreich 
and Trevisan deteriorates the query complexity by at most a quadratic factor. In the case of 
superexponential bounds (typical in applications of the regularity lemma), a quadratic loss is not 
terribly relevant 

However, for bipartiteness testing, much sharper bounds on the query complexity are known. 
Alon and Krivelevich [AK02J show that if a graph is e-far from bipartite, then with high probability 
a random sample of 0(l/e) vertices contains an odd-length cycle. This gives a bipartiteness tester 
that makes 0(l/e 2 ) queries, improving the query complexity of the original tester of Goldreich, 
Goldwasser and Ron by a factor of about 1/e. 

The Alon-Krivelevich tester is non-adaptive in the sense that its query sequence is independent 
of the graph in question (and only depends on the parameter e). Bogdanov and Trevisan [BT04 
show that any non-adaptive tester for bipartiteness must make £1(1/ e 2 ) queries into the graph. For 
general testers, they only prove a query complexity lower bound of 0.(1 /e^ I 2 ). 

It is natural to ask whether the Alon-Krivelevich tester can be improved. Gonen and Ron [GR08 
observe that the known lower bounds apply even for the subclass of graphs where the degree of 
every vertex is bounded by 0(en). For this special class of graphs, they give an improved algorithm 
for testing bipartiteness that runs in time 0(l/e 3 / 2 ). 

Gonen and Ron also give a different type of algorithm that tests bipartiteness for any graph 
where all vertices have degree at least ken (1 < k < 1/e) with running 6(1 jke 1 ). Although not 
stated explicitly in their paper, when taken together their results imply that bipartiteness is testable 
in time 0(l/e 2 ~ c ) for some constant c < 2 for any <i-regular graph, where d can be arbitrary. (We 
explain this point below. In fact it is sufficient for the graph to be "almost <i-regular" in the sense 
that the degrees of most vertices are between d and Kd for some constant K.) 

Recently Goldreich and Ron [GR09] gave the first examples of graph properties for which an 
adaptive tester provably outperforms the best non-adaptive one by a polynomial factorH They also 
give a candidate collection of properties which is conjectured to approach the optimal quadratic 
speedup for adaptive testers. 

The question of testing bipartiteness has also been studied in the sparse graph model, where 
Goldreich and Ron |GR98] give a tester whose running time on an n- vertex graph is 0(y/n) for any 
fixed degree d and distance parameter e. Although their discussion focuses on the case of constant 
d and e, their analysis in fact works even when d and 1/e grow in terms of n (although the bounds 
get worse). For a precise statement of this fact, see Theorem [3j 

Kaufman, Krivelevich, and Ron [KKR04J gave upper and lower bounds for testing bipartiteness 
for general-degree graphs in a model that combines features of the sparse graph and dense graph 
models. 

lr The deterioration in running time could be somewhat worse, but again such losses are not very relevant when 
using the regularity lemma. 

2 Although Gonen and Ron already show that adaptivity helps for testing bipartiteness of small-degree graphs, 
their result does not translate into a graph property because their notion of "small degree" depends on the distance 
parameter e. 
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1.1 Our result 

Recall that Alon and Krivelevich |AK02_j prove that if a graph is e-far from bipartite, then a 
random sample of Cl(l/e) vertices will contain an odd-length cycle with probability at least 1/2. 
We conjecture the following generalization: 

Conjecture 1. If a graph G is e- far from bipartite, then with probability 1/2 the induced graph on 
a random sample of 0(1/ e) vertices is Cl(e) -far from bipartite. 

Gonen and Ron [GR08] prove this conjecture for graphs whose degrees are bounded by Ken for 
some constant K. We give a more general proof (Theorem Hip that works for the case where G is 
d-regular (and even when all its vertex degrees are between d and Kd for a constant K). 

Conjecture [T] is a purely combinatorial statement about graphs, which by itself does not yield 
anything better than the original, nonadaptive tester of Alon and Krivelevich for testing bipartite- 
ness in time 0(1 /e 2 ). Assuming this conjecture, we prove that there exists an adaptive tester for 
bipartiteness that outperforms the best non-adaptive one: 

Theorem 2. Assume Conjecture d Then bipartiteness is testable with one-sided error in time 
0(l/e c ), where c is a constant strictly less than 2. 



On the conjecture Besides our proof that the conjecture is true for regular graphs, more evi- 
dence in favor of it comes from work of Fischer and Newman [FN07]. It follows from their work 
that there exists a function / such that if G is e-far from bipartite, thaen with high probability a 
random subgraph on /(e)-vertices will be 0(e)-far from bipartite. We know that /(e) > 1/Ke for 
some absolute constant K > for instance from |BT04j . Conjecture 1 postulates that this bound 
is nearly tight, namely /(e) = 0(1/ e). 

We believe that Conjecture 1 is interesting in its own right. A similar phenomenon has been in- 
vestigated in the context of probabilistically checkable proofs (PCPs) [BSGH+Ooj IDR06] . Roughly, 
a PCP is robust if a proof which is e-far from a correct one must encode an assignment which is 
0(e)-far from a satisfying one. Conjecture 1 says, in some sense, that bipartiteness is robust under 
takingrandom subgraphs on 0(1/ e) vertices. 

Even if the conjecture turns out to be false, we believe that coming up with a counterexample 
to it would be enlightening in understanding the complexity of testing bipartiteness. 



Comparison with the work of Gonen and Ron Gonen and Ron give two algorithms for 
testing bipartiteness that apply to two different classes of graphs. Their first algorithm runs in time 
0(e 3//2 ), but only applies to graphs where all degrees are at most 0(en). Their second algorithm 
runs in time 0(1/ ke 2 ), but only applies to graphs where all degrees are at least ken. 

As we explain below, the two algorithms of Gonen and Ron are fundamentally different from 
one another and there is no a priori reason why we should expect a single algorithm for testing 
bipartiteness in time 0(l/e 2_c ) that works for all graphs, In this work we provide such an algorithm, 
and we show that if Conjecture 1 is true, our algorithm is correct on all inputs. 



1.2 The algorithms of Gonen and Ron 

Our proof is inspired by the algorithms proposed by Gonen and Ron [GR08] for testing bipartiteness 
in graphs with some restriction on the degrees of the vertices. Gonen and Ron propose two such 
algorithms: One for "low-degree" graphs and another one for "high-degree" graphs. 
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Before we explain the two algorithms, let us apply the conjectured generalization of the Alon- 
Krivelevich result. This generalization allows us to replace a graph on n vertices by one on 0(l/e) 
vertices, while roughly preserving the distance from bipartiteness. 

Applying this conjecture comes at no loss in generality (if we disregard polylogarithmic factors in 
1/e): If testing bipartiteness requires q(e) queries on graphs of size s = 0(l/e), then it requires the 
same number of queries on graphs on n veritces whenever n is a multiple of s. The counterexample 
of size s can be scaled up to size n by performing an en-blowup of it|fj It is not difficult to see that 
this blowup operation preserves the distance to bipartiteness □ 

After applying the conjecture, let us use n to denote the number of vertices in the induced 
subgraph. Then (with high probability) this graph G will be 0(l/n)-far from bipartite. For 
convenience, we will make a change in terminology: We say that a graph is m-removed from 
bipartite if one cannot make it bipartite even after removing m arbitrary edges. Then the graph 
G will be fi(n)-removed from bipartite. We have now reduced the task to designing a tester for 
bipartiteness in G that runs in time 0(n 2_c ) for some constant c. This algorithm only needs to 
outperform the trivial algorithm by a small factor in the running time. However, when G is far 
from bipartite, it is only guaranteed to have Cl(n) violating edges - about one per vertex. 

To explain the low-degree and high-degree algorithms, it is useful to keep in mind these two 
example graphs on n vertices that are f2(n)-far from bipartite: 

• G\: A constant-degree expander - for example, a collection of d random matchings for some 
constant d. 

• G2- A -^/n-blowup of an odd length cycle of length about y/n. 

The low degree algorithm When the maximum degree d of G is much smaller than n, we 
can test bipartiteness by emulating the Goldreich-Ron algorithm for bounded degree graphs. In 
the adjacency lists model with distance parameter S, the Goldreich-Ron algorithm runs in time 
O(Vn) • 1/S K , where K is some constant. If a graph G of maximum degree d is 0(n)-removed 
from bipartite in the adjacency matrix model, then G is 1/d ■ poly log (ra)-far from bipartite in the 
adjacency list model, and we have the following consequence. 

Theorem 3 (The Goldreich-Ron algorithm). Let G be a graph on n vertices maximum degree d. 
Suppose we are given an adjacency list representation of G. Then there is an algorithm that runs 
in time 0(y/n) ■ d K such that (1) if G is bipartite, the algorithm says "bipartite" and (2) if G is 
Q(n) -removed from bipartite, the algorithm outputs an odd-length cycle of G with probability 9/10. 

For intuition, we briefly outline the Goldreich-Ron algorithm: This algorithm chooses 0(1/5) 
random starting vertices v, then performs poly(log(n)/<5) • yfn random walks out of v of length 
poly(log(n)/5) and looks for pairs of random walks - one of even length, the other one of odd 
length - that start and end at the same vertex, thus revealing an odd-length cycle. For a graph like 
G\, this algorithm works well because the random walks mix within O(logra) steps and we expect 
two of them to collide after about \fn attempts. We expect the length of the walks to be about 

3 A k-blowup of a graph G is obtained by replacing every vertex of G by a group of k vertices and every edge of G 
by a complete k by k bipartite graph between the corresponding groups of vertices. 

4 One shows that the optimal partitions always assign the same color to all vertices within a group. For general 
graph properties, however, this is not always the case GKNR08 . 
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evenly divided between even and odd, so in the first collision an odd-length cycle is likely to be 
revealed H 

In the theorem, K is some universal constant which Goldreich and Ron do not attempt to 
optimize. Since every query in the adjacency list model can be emulated by n queries in the 
adjacency matrix model, this algorithm runs in time (5(n 3//2 ) / 'd K for graphs given by adjacency 
matrices. Notice that the Goldreich-Ron algorithm outperforms the trivial algorithm not only when 
d is constant, but also when d is a sufficiently small power of n. However, when the degree becomes 
very large - for example as in the graph G2 - then it is not difficult to see that the running time 
of the algorithm can easily become (l(n 2 ) in the adjacency matrix representation. 

The high degree algorithm Now assume the degrees of all the vertices are at least d. This 
algorithm chooses a random set S of vertices of size about 0((n/d) log n), queries all their neighbors 
for edges, makes another 0((n 2 /d) log n) random pair queries, and looks for an odd-length cycle. 
The running time of this algorithm is 0(n 2 /d). Instead of explaining why this algorithm works in 
general, let's look at the representative graph G2. Here, even in the first phase of the algorithm, 
we are likely to have sampled at least one vertex in every location of the cycle, so after querying 
all their neighbors we see at least one copy of the odd-length cycle. 

On the other hand, the high-degree algorithm is unlikely to work well on graphs like G\ with 
constant-degree vertices: After the first phase, the graph has seen many isolated vertices (and their 
neighbors) in the graph, and it is quite unlikely that a cycle will be revealed in the second phase. 

1.3 Our proof 

It follows from our discussion that there is some critical degree d = n a , where a is a constant, 
with the following property: If all vertices have degree less than d, then the low-degree algorithm 
outperforms the trivial one; and if all vertices have degree greater than d, then the high-degree 
algorithm outperforms the trivial one. So it is natural to try and combine both algorithms and get 
a single one that works for all graphs. 

Now consider a graph G that is m = 0(n)-removed from bipartite and has both low-degree and 
high-degree vertices. Intuitively, if G is m-removed from bipartite, it could be for three reasons: 
Either there are many violating edges in the high-degree part of G, or there are many violating 
edges in the low-degree part, or there are many violating edges between the two parts. (To illustrate 
the third case, consider a graph that has n/2 vertices of degree about -y/n that are connected by a 
blowup of a cycle of even length about y/n, and n/2 vertices of constant degree that are randomly 
connected to these high-degree vertices.) 

The high-degree algorithm lets us take care of the first case: If we run the high-degree algorithm 
on G and it detects no odd cycle, then we can be fairly confident that the high-degree part of G 
is close to bipartite. But in fact we prove that the high-degree algorithm does more: In the case 
the algorithm fails to detect an odd cycle, its queries reveal an approximate spanning forest of the 
high-degree part of G: That is, after removing m/2 edges, we know exactly which vertices in the 
high-degree part must be of the same color and which must be of different colors. We prove this in 
Section El 

5 This crude explanation misses many important ideas of the Goldreich-Ron algorithm, and we refer the reader to 
their paper GR08 for a complete presentation. 
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Once we know this approximate partition of the high-degree part, we can try to run the low- 
degree algorithm, but "short-cut" its random walks once they enter the high-degree part. For 
instance, suppose we have two low-degree vertices v, v' that connected, respectively, to high-degree 
vertices w,w'. If the queries of the high-degree algorithm revealed that w and w' are of different 
colors, then we can redirect a random walk that attempts to go from v to w directly into w' . 

To be a bit more formal, the process we just described is an emulation of the graph G by a new 
graph G . The graph G will contain only those vertices of G that have low degree, all the edges 
between them, and some extra edges that are meant to capture the connectivity information among 
the high-degree vertices of G. For the emulation to be correct, it must satisfy four requirements: 
(1) If G is bipartite, so is G; (2) if G is far from bipartite, so is G; (3) queries in G can be emulated 
efficiently using queries in G; and (4) even after adding the extra edges, all vertices in G remain 
of low degree. 

This emulation of G by G is explained in detail in Section [H Ignoring many relevant issues, 
the basic idea is to replace each "component" of the high-degree part of G revealed in the first 
phase by a random graph of (about) constant degree on the low-degree vertices it connects to which 
is consistent with the partition. Since a random graph is likely to be an expander, it turns out 
that this transformation preserves distance from bipartiteness. On the other hand, since edges in 
a random graph can be generated "on the fly" by sampling, the emulation can be performed in a 
query-efficient manner. 

For somewhat technical reasons, it will be easier for G to be an instance of a problem which 
is a generalization of bipartiteness. This problem, which we call an XOR game (borrowing from 
PCP terminology), is a binary constraint satisfaction problem with two kinds of edge-constraints: 
^-constraints and =-constraints which say that the colors of the endpoints should be different and 
equal, respectively. 

Finally, in Section [5] we show that an XOR game can be transformed into an ordinary instance 
of bipartiteness without significantly affecting the degree and the query complexity, while roughly 
preserving the distance. 

To summarize, the transformations we have described allow us to take a graph which can have 
vertices of arbitrary degrees and either locate an odd-length cycle in the high-degree graph, or 
transform the graph into a low-degree graph which roughly preserves distance from bipartiteness. 
All these transformations are done in a query-efficient way. We prove Theorem [2] in Section [6) 

1.4 Notation 

A graph G is m-removed from bipartite if even after removing any subset of at most m edges from 
G, the resulting graph is not bipartite. A graph is e-far from bipartite (in the dense graph model) 
if it is 5(2) -removed from bipartite. 

For a graph G and a subset of vertices S, we write Gg for the induced subgraph of G on S. 

Let G be a graph, and G a subgraph of G (possibly on a subset of the vertices). A spanning 
forest of G in G is a forest F of G such that every two vertices in the same component of G are 
connected by a path in F. When G = Gs, abusing terminology we will write "spanning forest of 
S in G." 

An XOR game is a graph where each edge is labeled by = or 7^. The game is satisfiable if there 
is a 2-coloring of the vertices that satisfies all the edge labels, and it is m-removed from satisfiable 
if it cannot be made satisfiable even after removing any set of at most m labels. 
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We use Eq(S,T) for the set of edges between S and T in G. We say G is an a-expander if 
\Eg(S, S)\ > a\S\ for every set S containing at most half the vertices. 

In the dense graph model we are given access to the adjacency matrix of the graph. In the 
sparse graph model we are given adjacency lists for every vertex. 

2 Splitting the vertices by degree 

We give a simple, sampling-based algorithm that splits the vertices of G into those whole degree 
is roughly smaller than d and those whose degree is roughly larger than d. This algorithm runs in 
time 0(n 2 /d). 

Algorithm Degree: For every vertex v, query (v,w) for 24n log n/d random ws. If 
more than 24 log n edges are detected, put v in H. Otherwise, put v in L. 

Claim 4. With probability at least 9/10, every vertex in H has degree at least d/2, and every vertex 
in L has degree at most 2d. 

The claim follows easily from a large deviation bound. Let d(v) be the degree of v. Each w 
is a neighbor of v independently at random with probability d(v)/n, so the expected number of 
neighbors among the queried ws is logn • d(v)/d. By a Chernoff bound we get that the number 
of neighbors is in the range (24 ± 12) log n{d(y)/d) with probability 1 — l/10n. If this event holds 
for a particular vertex, then a vertex of degree more than 2d cannot end up in L, and a vertex of 
degree less than d/2 cannot end up in H. The conclusion follows by a union bound over all v. 

3 The algorithm for high degree vertices 

We now describe the algorithm for the high-degree part of the graph. The goal of this algorithm 
is to either detect a violation of bipartiteness in this part of the graph, or find an approximate 
spanning forest. 

Algorithm High-degree: Choose s = lOnlogn/d random vertices v of G and query 
(v,w) for every w. Query an additional t = 40n 3 (log n) 2 /md random pairs (u, u'). 

Claim 5. Assume G is m-removed from bipartite. Let Q be the set of queries performed by Algo- 
rithm 1. Then with probability at least 9/10, either Q contains an odd cycle, or there is a set of 
at m/2 edges such that after removing these edges from G, we obtain a graph G" such that G" H is 
bipartite and Q contains a spanning forest of G" H in G. 

Let S be the set of n log n/d random vertices v. With probability 19/20, S dominates H: The 
probability that any fixed vertex of H is not dominated is at most (1 — d/2n) s < l/20n, so by a 
union bound S dominates H with probability 19/20. 

Now assume S dominates H. We split the analysis in two cases. First, consider what happens 
if Gh is m/4-removed from bipartite. Then we show that with probability 19/20, Q will contain 
an odd cycle. Consider any fixed bipartition of S. This partition induces a bipartition on H, which 
must have at least m/4 violating edges. The probability that one of the random pairs (u,u f ) hits a 
violating edge is then (1 — m/4n 2 )* < 2~ s /20. By a union bound over all partitions of S, it follows 
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that all of the induced partitions on H have a violating edge in the sample with probability at least 
19/20. If this is the case, Q must contain an odd cycle. 

Now let's assume Gh is m/4-close to bipartite. Delete at most m/4 violating edges to make it 
bipartite and call the resulting graph G' . We now focus attention on G' and G' H . 

Consider the following partition % = {Hi, . . . , H^} (h < s) of H induced by S: For every 
Vi 6 S, let Hi be the set of those vertices in H connected to Vi in G' but not to any of v±, . . . , Vi-\. 
We show that with probability 19/20, any cut C that preserves % (i.e. C does not split any set 
Hi) with at least m/4 [log n] edges in G' H contains an edge sampled in the second stage. Consider 
a fixed such cut C. The probability that no edge in C is picked in the second stage is at most 
(1 — m/n 2 logn)' < 2~ s /20. We conclude by taking a union bound over all such C, as there are at 
most 2 s of them. 

Now assume that every cut C that preserves T~L has either an edge in Q or at most m /4 [log n] 
edges in G' H . Look at any maximal forest inside Q n E(G'). The partition of H into components 
induced by this forest (i.e. two vertices in H are in the same component if they are connected in 
Q) is a coarsening of the partition %. Let's call it %' . So every cut C that preserves %' has at 
most m/4 logn edges in G' H . Now take any collection of [log 2 |^'|l such cuts that shatter T~L' (that 
is, every pair of sets in %' is partitioned by at least one of the cuts). Removing all the edges in all 
these cuts will disconnect every pair of sets of H' in G' H . The total number of removed edges is at 
most (m/4[logn~|) • [log 2 1'H'H ^ m/4. After removing all these edges we obtain a new graph G" 
such that Q becomes a spanning forest of G" u in G. Moreover, G" is obtained from G by removing 
at most m/4 + m/4 = m/2 edges. 

4 Eliminating the high degree vertices 

We come to the step of replacing the high-degree vertices of G with "equivalent" edges in the 
low-degree part of G. The idea is to replace each (sufficiently "well-connected") component in the 
high-degree part of the graph with an expander graph that has the same connectivity properties 
(namely, edges of the expander go only among vertices that are on opposite sides of the partition 
induced by the trees in the spanning forest). Let G' denote the new graph. The vertices of G' will 
be only those vertices of G which are in L. However, in addition to the edges in L, G' will have 
some additional edges which are meant to describe the connectivity within the high-degree part of 
G. 

We need to do this transformation of G into G' in a manner that preserves the soundness of the 
instance (a graph that is far from bipartite stays far from bipartite), but does so in a query-efficient 
manner. Specifically, we want to make sure that every query in the G' can be simulated by a small 
number of queries in the original graph. To do so, we make a distinction between two types of 
high-degree components in G: Those that have high connectivity to the low-degree part and those 
that don't. 

Our construction of G' is probabilistic: G' will be a random graph which is guaranteed to meet 
the specified construction and answer queries in G efficiently with high probability. 

If a component C of high-degree vertices has fewer than 0(|C|) edges going into the low-degree 
part, then we can safely ignore it because disconnecting all such components in a graph which is 
far from bipartite keeps the graph far from bipartite. 

If a component has more than this number of edges going to the low-degree part, then we want 
to emulate this component in G' by an expander graph among the vertices of the low-degree part 
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consistent with the high-degree component. However, we do not know the structure of the G in 
advance (since we want an algorithm that runs in sublinear time), so we need to implement this 
expander graph "on the fly" . 

To be more specific, for each high-degree component C and each edge going out of this high- 
degree component to a low-degree vertex v, we want to introduce an expander edge going out of 
v in G' . We will construct this expander at random: To emulate an edge going from a low-degree 
vertex v to a high-degree vertex w in some component C in G, we want to choose a random edge 
between some other w' £ C of the opposite color and some v ' £ L and connect v to v' . The key 
point is that since the cut between C and the low-degree vertices is dense, such an edge can be 
found by random sampling in a relatively query-efficient way. 

To summarize, we replace each high-degree component C of G which has more than about |C| 
edges going into the low-degree part by a random graph (of small degree) between its low-degree 
neighbors. Since a random graph is likely to be an expander, this step is likely to preserve distance 
from bipartiteness. On the other hand, since by assumption the cut between C and L is fairly 
dense, the step can be implemented in a query-efficient way. 

The actual proof includes some additional technicalities. For example, some extra work needs 
to be done in order to determine if a component C has a sparse cut or a dense cut to L. We do so 
by a preliminary sampling step. 

Also, there could be an imbalance between the number of vertices of the two colors within a 
component C, in which case the sampling of an exit edge may take too long. To resolve this, we 
will allow to have edges within G' between vertices of the same color, but we will mark those edges 
as =-edges (to indicate that the colors of the endpoints are equal and not different). This gives a 
generalized version of bipartiteness: A constraint satisfaction problem with two types of constraints 
which indicate the colors of the endpoints are equal or not equal. We will allow for these generalized 
constraints and replace them with ordinary (inequality) constraints later on. 

Specification of G' . Assume Gh is bipartite. Let C be the collection of connected components 
of Gh- We say G' is a sparse version of G if the following conditions hold: 

1. The set of vertices of G' is L. 

2. For every edge (v,w) in G with v,w in L, (v,w) is an 7^-edge of G' . 

3. Assume C £ C satisfies \Eg(C, L)\ > \C\m/3n. Let Z be an arbitrary A-regular 1/2-expander 
on \Eg(C, L)\ vertices. Each vertex of Z represents an edge (v,w), where v £ L and w £ C. 
Then we have the following edges in G': For every pair (v, w), (v',w'), where v,v' £ L, 
w,w' £ C, if (v,w), (v',w') is an edge in Z, then we put an edge between v and v' in G' . 
This is an =-edge if w and w' are at even distance in Gh, and a /-edge otherwise. 

4. Assume C £ C satisfies \Eg(C,L)\ < \C\m/Qn. For every edge (v, w) such that v £ L, w £ C, 
there is an =-loop (v,v) in G'. 

5. If \C\m/6n < \E G (C,L)\ < \C\m/3n, then both are allowed. 

Notice that if Gl has max-degree 2d, then G' has max-degree 4Sd. 

Claim 6. Assume Gh is bipartite and G' is a sparse version of G. Then (1) If G is bipartite, then 
G' is satisfiable and (2) if G is m-removed from bipartite then G' is m/3-removed from satisfiable. 
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Part (1) follows easily as any consistent bipartition of G is by construction a consistent assign- 
ment of G' . 

For (2) we prove the contrapositive. Assume we have a partition (S, S) of L that violates at 
most m/3 constraints of G' . Write the number of violated constraints as m^ + YlceC m Ci where L 
is the number of violated constraints within L and mp is the number of violated constraints owing 
to component C. 

We show (S, S) can be extended to a partition of L U H that violates at most m edges of G. We 
define the coloring on H as follows: Since Gh is bipartite, for every component C of Gh there are 
two possible partitions. Among the two, choose the one that violates the smaller number of edges 
between C and L. 

We now upper bound the number of violated edges in G with respect to this partition. Within L, 
the number of violated edges is exactly mi. Within H, by construction there are no violated edges. 
So it remains to bound the number of violated edges between L and C for every C G C. For those C 
in which the construction in step (4) was applied, we have introduced at most \Eg(C, L)\ < \C\m/3n 
violating edges. Even summing over all such components C this introduces at most m/3 new 
violating edges. 

It remains to bound the contribution of those C for which the expander construction (3) was 
applied. Let (Tc, Tc) be the partition of the vertices of Z induced by the coloring on H: The color 
of e G Z is determined by the color of its endpoint inside C. Let (Sc, Sc) be the partition of the 
vertices of Z induced by the coloring on L: Here the color of e G V{Z) is determined by the color 
of its endpoint inside L. Notice that a constraint (e, e') of Z is then violated if and only if the 
(Tc,Tc) and (Sc,Sc) of (e, e') are inconsistent, namely 



Therefore, a pair (e, e') violates a constraint of Z if and only if it cuts across the partition (Sc © 
Tc, Sc © Tc). Assume without loss of generality that Sc © Tc is the smaller of the two sets. By 
the expansion of Z, the number of violated constraints mc of G' is at least as large as \Sc © 
Tc|/2.But \Sc © Tc\ is exactly the number of edges between L and H whose endpoints are colored 
inconsistently. 

We summarize the total number of violated edges in G: We have mi such edges inside L, 
at most m/3 edges coming from type (4) components and at most 2mc edges for every type (3) 
component C. So the total number of violating edges is at most 



Implementation of a G'-oracle given oracle access to G. This is a randomized procedure 
Reduce which is a query-efficient implementation of G' and is a sparse version of G with high 
probability. This procedure has oracle access to G and direct access to a spanning forest F of Gh- 
For each connected component C of Gh, Reduce will have a variable sc that has value "sparse" 
(indicating that the \Ec(C,L)\ is small) or "dense" (indicating it is large). Initially, all these 
variables are uninitialized. 



\(S c (e) © Sc(e')) © (T c (e) © T c (e'))\ = 1. 



(Here A(e) = 1 if e G A, if e A.) We can rewrite this equation as 



\(S c (e) © T c {e)) © (S c (e') © T c (e'))\ = 1. 
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Without loss of generality we will assume \Eq(C, L)\ is an even number. This can be enforced, 
say, by making two copies of every edge in G. 

The idea of Reduce is to build (in an online manner) an expander Z on the dense components, 
while replacing the sparse components with loops. The expander Z will consist of A random perfect 
matchings. These matchings are constructed in an online manner. For every component C of Gh 
that has been declared dense, the implementation of G' keeps track of the partial perfect matchings 
corresponding to this component. (Initially, when sc is uninitialized the collection of matchings is 
empty. 

It will be more convenient to specify G' in the sparse graph model (via adjacency lists). Remem- 
ber G is provided to us in the dense graph model. A query of the type (v, i) should be answered 
by (w, =) (if v and w are connected by an =-edge) (w, /) (if v and w are connected by a /-edge) 
or _L (if v has fewer than i neighbors). 

Here is how a query (v, i) for the ith neighbor of v G L is answered in G'\ 

1. Query (v,w) in G for all w to find all neighbors of v in G. There are at most 4d of them 
(remember we doubled the number of edges). Make two lists: One that contains those 
neighbors in L, and the one that contains those neighbors in H. Let k be the number of 
neighbors in H. 

2. If i > kA, return (w, 7^), where w is the [i — fcA)-th neighbor of v on the L-list (or _L if it 
doesn't exist). 

3. If i < kA, let w be the [^/AJ-th neighbor of v in H and j = i mod A. 

4. If w belongs to a component C such that sc is uninitialized, repeat the following experiment 
for 100n|L| log n/m steps: Choose random vertices u G C and z G L and query if (u, z) is an 
edge in G. If the number of detected edges is less than 1001ogn/4 set sc = "sparse" (and go 
to step 5), otherwise set sc = "dense" (and go to step 6). 

5. If w belongs to a component C of H such that sc = "sparse", return (v,=) (Self-loops.) 

6. If w belongs to a component C of H such that sc = "dense" : If this edge was already visited, 
return the previous answer. Otherwise, repeat the following procedure until an output is 
produced: Choose random vertices u G C and z G L. If (n, z) is an edge in G: If the jth 
matching edge corresponding to (u,z) has not been visited yet, mark that the jth matching 
connects (v,w) and (u,z) (i.e., mark that (v,i) goes to u and mark that the corresponding 
edge out of u with index (order of w)A + j goes to v). Output (u, =) if w and u are connected 
by a path of even length in F, and (u, /) if they are connected by a path of odd length. 

The amortized query complexity of the qth query into G' is the total number of queries G' makes 
into G while answering its first q queries divided by q. 

Claim 7. (Amortized query complexity analysis) Assume m = Cl(n). Fix any sequence of queries 
to G' . With probability 9/10, the amortized query complexity of the last query into G' is 0(n). 

Claim 8. (Correctness of implementation) Assume Gh is bipartite and F is a spanning forest of 
H. Further assume A > 31ogn. With probability 9/10, G' is a sparse version of G. 
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We explain the somewhat unusual assumption on A below. To prove both of these claims, we 
start with the simple claim that with probability 19/20 procedure Reduce correctly identifies the 
sparse and dense cuts between L and H : 

Claim 9. With probability at least 19 / '20 , the following is true for all components C ': If\Ec(C,L)\ > 
\C\m/3n then sc = "dense" and if \Eq{C, L)\ < \C\m/6n then sc = "sparse". 

Proof. Let s = 100n|L| log n/m. Suppose \Eq(C, L)\ > \C\m/3n. Then each experiment in round 
(4) independently hits an edge of G with probability p = \Eg(C,L)\/\C\\L\. The expected number 
of hits is at ps > 1001ogn/3. By a Chernoff bound, the probability of getting fewer than 3/4 the 
expected number of hits is less than l/20n. But if we get at least 3/4 the desired number of hits, 
sc is correctly set to "dense". Similarly if \Eg(C,L)\ < \C\m/3n, we get an expected number 
of no more than ps < 101ogn/6 hits, and the probability of getting more than 3/2 the expected 
number of hits is less than l/20n. But if we get at most 3/2 the expected number of hits then sc 
is correctly set to "sparse" . Taking a union bound over all C (there are at most n of them) we get 
the desired result. □ 

To prove Claim we need to argue that every query in G' can be emulated by 0(n) queries in 
G. We reason like this. On query (v,i), we take time 0(n) to list all the neighbors of v in G. If 
the ith neighbor happens to be in L (step 2 in algorithm Reduce), we have answered the query in 
0(n) time. Otherwise, if sc is uninitialized, we take another 0(n) time to initialize it. So if C is 
sparse, we answer the query in 0(n) time. We now come to the more interesting case where C is 
dense. 

Consider what happens when we query an edge in the jth matching in some dense component 
C. If this edge was queried previously, we are done in time 0(n). But otherwise, we have to find 
a random edge between C and L previously unvisited in the jth matching. The more edges we 
see in the jth matching, the harder the new ones become to find. This is essentially the coupon 
collecting problem, so we expect to visit the edges in the jth matching in amortized time O(logn). 
Since it takes extra time 0(n) to just locate a single random edge between C and L, we get the 
desired amortized query complexity. 

Proof of Claim\^ By Claim[9]with probability 19/20 if s c = "dense", it must be that \E(C,L)\ > 
\C\m/6n. We assume that this is the case. Let Tj denote the time taken to answer the ith query, 
1 < % < q. The quantities Tj are random variables and we will bound the expectation of their sum. 

Without loss of generality we will assume that no queries are repeated. Also, if the jth neighbor 
of v was already determined to be the j'th neighbor of u, we will assume the query (u,f) is not 
repeated. In all these cases the answer can clearly be determined in 0(n) time (and even much 
less). 

We now group the queries into a set R and sets Sc,j, where R are queries within L or between 
L and sparse components of C and Sc j are queries that involve the jth matching inside dense 
component C. By linearity of expectation, 

E[T 1 + ... + T q ] = ^2E[T k ] + Yl E E ™- 

keR C,j k&S c ,j 

As we argued above, each entry in the first some is bounded by Kn 2 log n/m for some constant K. 
We now consider the entries in the second sum. Fix C and j and let k\ < • • • < kt be the order 
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of the queries in Sc,j- Let e = \E(C,L)\. Since the queries do not repeat, t < e/2. In order to 
answer query ki + \ (once k\, . . . , ki have been answered), we need to locate an edge between C and L 
which is not already matched in the jth matching. The number of already matched edges in the jth. 
matching is 2i. This leaves e — 2i good edges between C and L that can be used for the jth matching 
out of v. The probability that such an edge is hit by a random pair (u G C, z G L) is (e — 2i)/\C\ \L\ . 
Therefore, the expected number of attempts before such a pair is hit is |C||L|/(e — 2i). 

It follows that the expected time E[T^J to perform the ith query is at most if|C||L|/(e — 2i), 
where K is some fixed constant. Adding all of these together we get that 

£ E[T k ] < K £ M < K J2 ^ = K\C\\L\ f: < 2K\C\\L\ loge, 

keScj »=i »=i «=i 

using the standard estimate of the harmonic series. Since C is dense, we have that 

^2 E[T k ] < 2K\L\ ■ — • loge = (9((n 2 log n/m) ■ t) 

because t < e/2. Using the assumption that m = Vt{n) and Markov's inequality, we obtain the 
desired amortized query complexity. □ 

To finish the proof of Claim [8] we additionally need to show that for each component C that 
was identified as dense, the simulated queries in step (6) are consistent with an expander graph on 
the set Z. To prove this, notice that algorithm Reduce implements Z by a collection of A random 
matchings. It is a well-known fact that such a collection of random matchings is an expander graph 
with high probability. In order to be able to take a union bound over all components C, we need 
to make the probability of each cut large enough in n, and this is why we need to assume that 
A > 31ogn (as supposed to constant as is standard in such calculations). 

Proof sketch of Claim By Claim [9] the classification of components into sparse and dense is cor- 
rect with high probability, so it remains to argue that with high probability for all dense components 
C, all graphs constructed in step (6) are expanding. Let's fix such a dense component C and let Z 
be the corresponding graph. We will prove that the probability Z is not 1/2 expanding is at most 
l/20rt. The conclusion then follows by a union bound over all C. 

Let z = \Z\ and fix a partition (5, S) of Z with s = \S\ and s < z/2. We first consider the case 
s < z/4. Start choosing the edges in the first A/2 matchings going out of S one by one. At each 
point, the probability that the other endpoint of the edge lands inside S is at most 2s/ z. Taking a 
union bound over all S of size s, we get that the probability that fewer than s/2 edges land outside 
S is at most 

©■(^•©^•^(^(V^i'^^A^)-, 

using the inequality (?) < (en/k) k and the assumption s < z/4. Now we use the fact that 
A > 31ogn and conclude that the probability that some cut of size at most z/4 is violated is no 
more than 

z/4 

(16e 3 A/2 A ) 1 / 2 + ^(16e 3 A/2 A ) s / 2 < l/40n 

s=2 
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for sufficiently large n. Now suppose z/A < s < z/2. In this case, the expected number of edges in 
the cut (S, S) is As(z — s) > As/2. The events that an edge is in the cut are not independent, but 
they form a martingale, so we can apply Azuma's inequality (see for instance [BOT02] for a similar 
application of Azuma's inequality) to conclude that fewer than half of these edges make it into 
the cut with probability at most e _As//8 < e _Az / 32 < 2 _2 /40re, using the assumption A > 3 log re. 
Taking a union bound over all such cuts we get an extra contribution of l/40re, so the probability 
Z is not expanding is at most l/20re. This concludes the proof. □ 

5 From an XOR game to a bipartite graph 

We give a bounded degree reduction from XOR games to bipartiteness. Assume G is an XOR game 
with n vertices of maximum degree D. Consider the following graph G': 

1. G' has 2n vertices. Each vertex v of G is represented by a pair v,v'. 

2. For every 7^-edge (v,w)in G, we put the edges (v,w) and (v',w') in G' . 

3. For every =-edge (v,w) in G, we put the edges (v,w') and (v',w) in G'. 
A. For every vertex v in G, we put in D parallel edges (v,v') in G 1 . 

Notice that G' is a graph on 2n vertices with maximum degree 2D. Moreover, given oracle 
access to G, a query in G' can be implemented in constant time (in the bounded degree model). 

Claim 10. Assume G is m-removed from satisfiable. Then G' is 2m-removed from bipartite. 

Proof. We show the contrapositive. Assume G' is 2m-close to bipartite. Consider the optimum 
partition of G' . We think of it as a disjoint union of two partitions (SLIT, SUT), where (S, S) is a 
partition of the ^-vertices, and (T, T) is a partition of the v' vertices. We will show that the better 
one of the partition (S, S) and (T, T) violates no more than m edges in G. 

Let e=(U) be the total number of =-edges of G with both endpoints in U. Let e=(U, V) be the 
total number of pairs (u, v) where u G U , v £ V , and (u, v) is n =-edge of G. (We allow for U and 
V to intersect.) We define e^(U) and e^(U,V) analogously. 

The number of constraints in G violated by both (S, S) and (T, T) in is then 

MS) + e^(S) + e = (S, S)) + (e^(T) + e + {T) + e=(T, T)) . 
On the other hand, the number of edges of G' violated by the partition (S U T, S U T) is 

MS) + e^(S) + e # (T) + e^(T)) + (e = (S,T) + e = (S,T)) +D\S®T\, 
where S © T is the symmetric difference of S and T. We now argue that 

e=(S,S) + e = (T,T) < e = (S,T) + e = (S \T) + D\S © T\. 

We have 

e=(S,S) = e = (S,Sr\T) + e = (S,Sr\T) < e = (S,T) + e=(S,S flT) 
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and by symmetry 

e=(T,T) < e = (T,S) + e = (T,Tr\S). 

Since S (IT and T n S are a disjoint partition of S © T, we get 

e= (5,5nT) + e = (T,fn5) < £>|S©T| 

which proves the desired inequality. It follows that together the partitions (S, S) and (T, T) violate 
at most 2m edges of G, so one of them will violate at most m edges of G. □ 

6 Proof of the main theorem 

We now describe how the above steps combine into an algorithm for testing bipartiteness. Let K 
be the constant from the Goldreich-Ron algorithm. On input a graph: 

1. Choose a random subset of 0(1 /e 2 ) vertices of and denote the induced subgraph on those 
vertices by G. Let n be the number of vertices of G and set m = n/(logn) c where c is a 
sufficiently large constant. Set d = n 3 / 2 ^ +1 ) and A = 31ogn. 

2. Run algorithm Degree on G. 

3. Run algorithm High-degree on G. If the algorithm produces an odd-length cycle, reject. 
Otherwise, let F be any maximal subforest of the queries. 

4. Let G\ be the (oracle) XOR game obtained from G and the forest F using the implementation 
in Section HI 

5. Let G2 be the (oracle) graph obtained from G\ using the XOR-to-bipartiteness reduction in 
Section [5j 

6. Run the Goldreich-Ron bipartiteness algorithm on G2 and return its answer. 

Completeness analysis The completeness of this algorithm follows by construction: Any bi- 
partition of the original graph carries over to the graphs G\ , G2 and G3 , so if the original graph is 
bipartite the algorithm will always say so. 

Soundness analysis We now argue soundness. We assume the original graph is e-far from 
bipartite. By Conjecture [IJ with probability 9/10, G is m-removed from bipartite. If this is the 
case, then by Claim [5j with probability 9/10, unless the algorithm has found an odd cycle in step 2, 
there exists a subgraph G' of G obtained by removing at most to/2 edges such that F is a spanning 
forest of H in G' and G 1 restricted to H is bipartite. 

The set F could contain edges that are not in G'; add these to G' to obtain a graph G" . Then 
F remains a spanning forest of G" and G" restricted to H is still bipartite (any bicoloring induced 
by F is good for both G' H and G'^). Moreover, G" H is still m/2-removed from bipartite (since we 
only added edges to it). 

By Claim [HJ with probability 9/10, G\ is then a sparse version of G" . By Claim if this is 
the case, then G\ is m/6-removed from bipartite. Finally, by Claim [TU1 G2 is m/3-removed from 
bipartite. Then the Goldreich-Ron algorithm will detect an odd cycle with probability 9/10. Taking 
a union bound over all failure events we get that an odd-length cycle is found with probability at 
least 7/10. 
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Running time analysis The running time analysis is straightforward. Steps 1, 2, and 3 all run 
in time 0(n 2 /d). For step 4, by Claim [7] with probability 9/10 the running time of the remaining 
part is 0(n) ■ q, where q is the number of queries performed by the Goldreich-Ron algorithm. 
Notice that the maximum degree of vertices in G2 is 2Ad < 6dlogn, so this algorithm will take 
time 0(n c ), where c = 2 — 3/2(k + 1). This is a constant strictly less than two. This running time is 
achieved only with probability 9/10. By stopping the algorithm after running for this many steps 
and declaring the graph is bipartite, we get an algorithm that always runs in time 0(n c ) accepts 
all bipartite graphs, and rejects non-bipartite ones with probability at least 3/5. 

7 Proof of the conjecture for regular graphs 

We prove that Conjecture Q] is true when the graph is (i-regular for any d. The proof can easily be 
extended to almost-regular graphs, where the degree of every vertex is between d and Kd for an 
arbitrary constant K, but we won't do so for simplicity. 

Theorem 11. Assume G is e-far from bipartite and d-regular. Let S be a random subset of 
0((log(l/e)) 2 /e) vertices ofG. With probability 9/10, S is £l(e) -far from bipartite. 

Let d = ken. We will think of the set S as consisting of several parts, namely S = S\U- ■ -U5 S UT, 
where s = 8/clog(l/e). The set Si is chosen by putting each element in it indepenently at random 
with probability 10 log(l/e)/ ken. The set T is chosen by putting each element in it independently 
at random with probability p = 1001og(l/e)/en. 

Say the pair (S*,T) is violating if for every partition of S* U T, either T contains log(l/e)/e 
monochromatic edges, or there are at least 1/ke monochromatic edges between T and S*. 

Claim 12. Assume G is e-far from bipartite and S* dominates all but e/4 fraction vertices of G. 
Then the pair (S*,T) is violating with probability at least 1/2 over the choice of T . 

Proof. Fix a partition of S* and consider the induced partition on those vertices in G dominated by 
S*. (That is, we color every such vertex in G using the opposite color of its lexicographically smallest 
neighbor in S*.) We now upper bound the probability that among those vertices of T dominated by 
S* , there are fewer than 51og(l/e)/£ monochromatic edges with respect to the induced partition. 

Since S{ fails to dominate only e/4 fraction of vertices, there must be m > e(^)/2 violating edges 
among those vertices of G dominated by Si with respect to the induced partition. The expected 
number of such violating edges that make it inside T is then at least p 2 m. By an application 
of Janson's tail inequality (see for example [AS08]), we find that the probability that fewer than 
p 2 m/2 > 51og(l/e)/e monochromatic edges make it inside T is at most e~ pm ^ d = e ~ 41o s(V E )/ fe£ . 

By a union bound, the probability that there exists a partition of S* such that the induced 
partition on T has fewer than 51og(l/e)/e violating edges is at most 1/8. By another large devi- 
ation bound (Chernoff + union bounds), the probability that T has a vertex of degree more than 
4felog(l/e) is at most 1/8, so with probability at least 1/4 neither of these events hold. 

Now we argue that if neither event holds then (S*,T) is a violating pair. Suppose not. Then 
there is a partition of S* U T where T contains fewer than log(l/e)/e monochromatic edges and 
there are fewer than 1/ke monochromatic edges between T and Si. If we now change the color 
of the vertices in the partition of T that are inconsistent with Si we have introduced at most 
(1/ke) ■ (4k log(l/e)) new violating edges, by the bound on the degree. So we obtain a partition of 
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SiUT with fewer than log(l/e)/e + 41og(l/e)/e < 51og(l/e)/e monochromatic edges, contradicting 
the first assumption. □ 



Now let S* be a set chosen from the same distribution as Si. Since S* dominates all but an 
e/4- fraction of vertices with probability at least 3/4, we get that 

Pt s *,t[{S*,T) is violating] > 1/2 

and 

Pr T [Pr 5 »[(,S*,T) is violating] > 1/4] > 1/4. 
Fix a T such that the inner probability is at least 1/4. By Chebyshev's inequality, 

P r Si,...,5 s [at least s/8 pairs (Si,T) are violating] > 7/8 

So by a union bound, 

P r T,5i,...,5 s [at least s/8 pairs (Si,T) are violating] > 1/8. 

We now show that if there are s/8 such violating pairs, S must be log(l/e)/e-removed from bipartite. 
Consider an arbitrary partition of S. If T contains \og(l/e)/e monochromatic edges then we are 
done. Otherwise, There are at least 1/ke monochromatic edges between T and s/8 of the sets Si. 
This also gives s/8ke = log(l/e)/e monochromatic edges. 

To amplify the success probability from 1/8 to 9/10 we take 20 random disjoint copies of S. 
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